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Abstract 

We bound the hereditary discrepancy of a hypergraph 7i in two colors in 
terms of its hereditary discrepancy in c colors. We show that herdisc(7i, 2) < 
i^cherdisc(7^, c), where K is some absolute constant. This bound is sharp. 



1 Introduction 

Discrepancy theory examines uniformity problems of combinatorial and geometric 
objects as well as sequences of numbers. 

The discrepancy problem for hypergraphs is to color the vertices of a hypergraph with 
fixed number of colors such that each hyperedge contains about the same number 
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of vertices in each color. This problem is the subject of a number of famous re- 
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BF81| . and Spencer's 'six standard deviations' 



Surprisingly, almost all research so far regarded hypergraph discrepancies for two 
colors only. It was only recently that the gener al case was formally introduced 
and investigated by Srivastav and the first author DSOSj]. Their results revealed a 
remarkable peculiarity. Although many of the classical results could be generalized 
to the multi-color case, it turned out that there is no general relationship between 
the discre pancies of a hypergraph in different numbers of colors. In particular, it was 
shown in Doe02| that there exist families of hypergraphs with zero discrepancy in 
certain numbers of colors and high discrepancy in all others. 



This dichotomy could be solved 



m 



DoeOJ]. There it was shown that the hereditary discrepancy, which is the maxi- 
mum discrepancy among the induced subhypergraphs, is almost independent of the 
number of colors. In this paper, we further sharpen this relation to bounds that are 
tight up to constant factors (independent of the numbers of colors and the size of 
the hypergraph). 



2 Combinatorial Discrepancy Theory 



In this section, we give a short introduction to discrepancies of hypergraphs and 
matrices. The interested reade r is invited to consult the nice survey on discrepancy 
theory by Beck and Sos BS95[ | for further information. 



2.1 Combinatorial Discrepancies of Hypergraphs 



A (finite) hypergraph Ti is an ordered pair (V, £) where K is a (finite) set of vertices 
and £ C 2^ is a set of (hyper)edges. In the following, we restrict ourselves to finite 
hypergraphs without explicit notice. We also stick to the conventions to denote the 
number of vertices by n and the number of edges by m. 

Let c > 2. A c-coloring of Ti. is simply a mapping x • ^ ^) where M is some 
set of cardinality c. Typically, we have M = [c] := {1, . . . , c}. The hypergraph 7i 
is ideally colored by x if each color d occurs equally often in each hyperedge, that 
is, if \x~^(d) f] E\ — ^ \E\ for all d e [c] and E e £. Hence, we define the c-color 
discrepancy ofH with respect to x by 

disc(7i, X, c) := max | \x-\d) nE\-l\E\\. 

d£[c],E&£ ' 

The c-color discrepancy ofH. is now given by 



disc(7Y, c) min disc(7Y, x, c). 

To get some intuition for this concept, let us regard the following hypergraph Ti^ 
Let A, B be disjoint sets of cardinality n and V. — {AliB,£) with £ — {E <Z AUB 

\EnA\ = \EnB\}. 

Fact 2.1. For all n the discrepancy of Tin satisfies the following. 

(a) disc(?in,2) = 0. 

(b) For all c> 2, disc(7i„, c) > ^. 
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In particular, we see that a hypergraph may have very different discrepancies in 
different numbers of colors. As we will see, this is not possible for the hereditary 
discrepancy. 

Let Vq V he a subset of the vertices. Then TC\Vo '■— i^o, {E nVo\EE S}) is 
called induced subhypergraph of Ti. The c-color hereditary discrepancy of 7i is the 
maximum c-color discrepancy among the induced subhyper graphs: 



herdisc('H, c) = max disc (Hi vb, c). 



Again, we roughly estimate the hereditary discrepancy of 7i„ in different number of 
colors. 

Fact 2.2. For all neN, 

^ < herdisc(H, c) < ^. 



That the hereditary discr epancie s are similar in different numbers of colors, was 
proven by the first author Doe04 |. 



Theorem 2.3. For all hypergraphs 7i and all numbers a,b & N>2 of colors, 



herdisc(7i, b) < a^{b — 1) herdisc(7i, a). 



In Section [3], we sharpen this bound to herdisc(7Y, 6) < fCa herdisc(?i, a) for an 
absolute constant K. The linear dependence on a is necessary, as shown in Section |H 
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2.2 Discrepancies of Matrices 



We can describe a hypergraph via its incidence matrix. Let Vi, . . . ,Vn & V he the 
vertices and Ei, . . . , Em & S he the edges of a hypergraph V.. Then the incidence 
matrix with respect to these orderings of V and £ is the m x n matrix A — {uij) 
with 



11 if Vj e Ei , 
else. 

For a given c-coloring x : y ^ [c] of a hypergraph, we define an associated c-coloring 
of its incidence matrix as a mapping p : [n] ^ [c] with p{i) :— x(i'i) for all i e [n]. 
The following definitions extend the discrepancy notions to matrices in the sense 
that the discrepancies of a hypergraph and its incidence matrix are the same. For 
arbitrary matrix A e and c-coloring p : [n] — > [c] , we define 



disc{A,p, c) :- 

disc(74, c) := 
herdisc(A, c) : = 



max 

d6[c],je[m] 



min disc(74,p, c), 

p:[n]-^[c] 

max disc (An, c). 

AoQA ^ ' 



Here we write C A to denote that Aq is a submatrix oi A. In the following, we 
will mainly work with the more general concept of matrix discrepancies. 
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2.3 Weighted Discrepancies 



The discrepancy notions introduced above all refer to the problem of partitioning 
into equal sized parts. Occasionally, and in particular in this paper, a more general 
concept is helpful. In the language of hypergraphs, the notion of weighted discrep- 
ancy refers to the problem of coloring the vertices in such a way that each hyperedge 
has a certain fraction of its vertices in each color class. 

Fortunately, we need such weighted discrepancies only for two colors. Here the 
problem is slightly simpler as we only have to regard one color. Note that an excess 
of vertices in one color in some hyperedge yields the same number of vertices missing 
in the other color. 

In the matrix language, we define the following. A mapping p : [n] ^ [0, 1] shall be 
called floating coloring. For two floating colorings p and q, we define their discrepancy 
by 

^ aijipij) - qU)) ■ 
je[n] 



dA{p,q) := ||A(p,g)||oo = max 



Let z e [0, 1]. Denote by In the unique mapping [n] — > {1}. Then 



wdisc{A,z):= min dA{zln,q), 

q:[n]-*{0,l} 

wdiscf^, 2) := max wdiscfA, z), 
2e[o,i] 

herwdiscfA, 2) := max wdiscfAo, 2). 

AqCA 



Having introduced the (hereditary) weighted discrepancy, we recognize the 'usual' 
discrepancy notions as a special case. 



6 



Fact 2.4. For all matrices A, we have 



disc(A, 2) = wdisc(y4, |), 
herdisc(74, 2) = herwdisc(y4, i). 

Surprisingly, the weighted discrepancy problem is not much harder than the afore- 
mentioned sp ecial c ase. The following theorem follows from a famo us resul t by Beck 



and Spencer |BS84| as well as Lovasz, Spencer, and Vesztergombi |LSV8' 
Theorem 2.5. For all matrices A, we have 

herwdisc(yl, 2) < 2 herdisc(y4, 2). 



The additional strength of the notion of weighted discrepancies is visible e.g. in the 
following result, which we will later also need. 



Theorem 2.6 (Doerr, Srivastav DSOSj). For an arbitrary matrix A G M™^"™ and 
any c E N>2, 

herdisc(A,c) < 2.0005 herwdisc(A, 2). 

3 Discrepancies in Different Numbers of Colors 

In this section, we improve Theorem 12.31 by proving the following bound. 
Theorem 3.1. For any matrix A G M*"^" and arbitrary a,b E N>2, we have 



herdisc(A, 6) < Ka herdisc(A, a), 



where K is some absolute constant less than 2.0005. 

For the proof, we will take a detour through two colors. Since by Theorem 12. 6[ the 
hereditary discrepancy in h colors is at most of the order of the hereditary weighted 
discrepancy in two colors, it remains to bound the hereditary weighted discrepancy 
in two colors by the a-color hereditary discrepancy. Hence, the main part of this 
paper is to prove the following theorem. 

Theorem 3.2. For any matrix A G M™^" and arbitrary c G N>2, we have 



The proof of Theorem 13.11 now is an easy corollary. 

Proof of Theorem \3.1[ We first bound the hereditary discrepancy in h colors by the 
2-color weighted hereditary discrepancy and afterwards bound the latter by the 
hereditary discrepancy in a colors. Hence, we get 



herwdisc(A, 2) < c herdisc(y4, c). 



herdisc(A,6) < 2.0005 herwdisc(A, 2) 
< 2.0005a herdisc(A, a) 



by Theorem 12.61 



by Theorem 13.21 



□ 
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3.1 A Simple Case 



Both to give some more insight and to use it later, let us first analyze the case where 
a is a multiple of b. 

Lemma 3.3. Let a = k ■ b for some k E N. Then 

herdisc(A, 6) < k herdisc(A, a). 

Proof. Let Aq G M™o^"" be an arbitrary submatrix of A. Choose an a-coloring 
p : [^o] [a] with disc{Ao,p,a) < herdisc(74, a). We will define a 6-coloring q : 
[^o] ^ [b] by clubbing together k color classes of p respectively, i.e., 

q{i) = p{i) mod b. 

We obtain disc(y4o,&) < disc(74o, g, 6) < k herdisc(y4, a). Since we chose Aq arbitrar- 
ily, the claim follows. 

□ 

If a is not a multiple of 6, we cannot evenly combine color classes as done above. 
We could apply Lemma [3.31 only using the 6[|J largest color classes and recursively 
repeat this procedure on the vertices that were not colored so far. 

However this recursive approach might take logj^y(-^_i, |^ajj(no) < log2(no) iterations, 
yielding a bound with logarithmic dependence on the number of vertices. 
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3.2 The Main Result 



In this section, we bound the 2-color weighted hereditary discrepancy in terms of 
the hereditary discrepancy in c colors, that is, we prove Theorem I3.2[ 

For c even, this can be deduced from Theorem 12.51 and Lemma 13.31 since 

herwdisc(y4, 2) < 2 herdisc(yl, 2) by Theorem 12.51 

< cherdisc(A, c) by Lemma 13.31 

Hence, the crucial case is that c is odd. We show that any constant floating coloring 
p := zln, where z has finite c-ary expansion of length i, can be 'rounded' to a floating 
coloring q having c-ary length at most i — 1 and dA{p,q) = 0(c~^"*"^). Removing 
the obstacle that q is not necessarily a constant floating coloring, we can iterate this 
rounding procedure until we obtain a true coloring. 

The following lemma describes one iteration of the rounding process. If some number 
X can be written as a; = Yll=o '^jC"* with £ G N and oq, . . . , G {0, 1, . . . , c — 1}, 
we say that x has c-ary expansion of length i. By \x\^ we denote the length of a 
shortest c-ary expansion of x. We set = cxd, if a; has no finite c-ary expansion. 
For £ G No and C N, we define 



M,,, := {x G [0, 1] I \xl < £} , 
C5 ■■= {P\P--N^M,,^}. 

Lemma 3.4. Let c be odd, t G M^/ and p G C)."J with p{i) = t for all i G [n]. Then 
there exists a p' & such that 

dA{p,p) < |(c- l)c-^+^herdisc(Ac). 
10 



Proof. Let t = Ylk=o^kC ^ for some tk E {0, . . . , c — 1} denote the c-ary expansion 
of till and [t\f_i := Ylk^o^kC"'' G Mc/-i be its rounding of c-ary length i — 1. 



Choose a c-coforing g : [n] — > [c] such that disc(y4, g, c) < herdisc {A,c). Denote by 
Ji := {j G [n] I = i} for i G [c] the partition classes defined by q. Let J[t^] : = 
Ufc=i Jk be the union of the first t£ partition classes. For all j G [n] put 



p'U) ■-- 



[t\e-i + c-'^' ifj GJ[,,], 
[t\e-i else. 



Note that G Mc,£_i for all j G [n]. Hence p' G 



For all i G [m], we compute 



^Remember that this expansion of t is unique. 



11 



J6M 



^ aij (c hi - c ^+^) + ^ aijC Hi 

< c^^^^ min{t£, c — t^} herdisc(A, c 

< i(c- l)c-^+^herdisc(A,c) 



(1) 
(2) 



Here (1) follows from the fact that 



J2 ( Z ^^^^ ' - Z = Z ( Z ^^^^ ' - Z 



For (2), we note that c is odd. 



□ 



While we assumed all values of p to be equal, its rounding p' may take two different 
values. To apply Lemma [3.41 iteratively. we generalize our result for that purpose. 
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Lemma 3.5. Let c be odd and p G C^J taking t different values. Then there exists 
a q & C^^l_i such that 



dA {p, g) < |(c - l)c"^+Vhcrdisc {A, c) 



Proof. We apply Lemma 13.41 on subinstances of the rounding problem where p is 
constant. Let t^^\ . . . , t^'^^ G Mc/ be an enumeration of the different values of p. For 
all G [r], define J'k ■= {j G [n] \ p(j) = t^'^-'}. Writing Aj for the j-th column of A, 
we denote by A^j^^ the submatrix consisting of the columns Aj of A with j G 

Now p\j^ is a constant floating coloring for A\j^. Apply Lemma [231 on A\j^ and p\ji^ 
to obtain a floating coloring q^^^ G Cfg^i with 



dA,^^ipij,,q^'^) < |(c- l)c-^+^herdisc(A|^,,c) 
< |(c - l)c-^+^ herdisc {A, c) . 



Now let q be the union of the g*^^\ that is, q G C'^.'^Li such that q{j) = q^^\j) for all 
j G jTfc and /c G [r]. 

Since the J7fc form a partition of [n] , we have 



^We adopt the formal view that a matrix A G is a function A : [to] x [n] — > M. Hence, the 

submatrix A\j is simply the restriction A\j : [m] x {hi) ^ A{i,i) for all i £ [to] and j e 
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d-A {p, q) = max 
je[m] 



fce[r] ieJfc 



< V max V aij {p{j) - q{j)) 

< i(c- l)c"^+Vherdisc(A,c). 



□ 



Starting with a floating coloring p G C^"^, we iteratively apply the lemma above i 
times to get a true coloring q. The crucial observation now is that if our initial 
floating coloring p is constant, then all colorings constructed by applying Lemma [331 
take at most two different values. In other words, we always have r < 2 when 
invoking Lemma [3.51 

Lemma 3.6. Let c be odd, t E M^/ and p = tin. Denote by g'-*-' the vector obtained 
after the i-th iteration of applying Lemma \3. 51 on p. Then the number of different 
values of q^'^^ is at most 2 for all i G {0, ...,£} . 



Proof. We distinguish two states of the rounding algorithm. In state 1, there are at 
most two different values of the current vector g^*) and all values differ only in the 
last digit, that is, g*-'^ takes at most two values t*-^-* and t*-^-* such that [^'•"'^^J = 
|_^*"^''J^_j_i- state 2, the current vector g*^*^ takes two values t^^^ and t*-^-* of different 
c-ary length such that t'^^^ = t*-^-* + c^'+*, = £ — i — a and li*-^-* |^ = i — i ioi some 

o G [/ — i]. Furthermore, if t^^-* = Xlfc=o '^i^'*'^"'^ denotes the c-ary expansion of jt*^^-*, 
thent£_, = c-lforallA;G{0,...,o-l} and [t«J^_._^_^ = [t^^^ J, We 
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show that the algorithm is always in one of these two states and in particular that 
no other state is reached. 



Clearly, the algorithm starts in state 1. 

Assume that after iteration i the algorithm is in state 1 and let t^^^ and t^^) be 
defined as above for state 1, where t'^^^ = Ylk^o^^k^'^~^ denotes the c-ary expansion 
of t^^\ By applying the procudure described in Lemma 13.51 we get q''^^^\j) = 
[t(i)J^_.__^ + 0"^^+*+^ or = for all j G [n]. Note that we have 

exploited the fact that [t^^^J^_._i = [t^^^i^,_r We now distinguish two cases. 
If |_^*"^''J^_j_;^ + c^^^^'^^ results in no carry over, that is, if ^l^^i ^ c—1, then we have 
\ g_^_2 — \_^^^^\e-i-2 ^ ^ t'^]' '^^^^5 iteration i + 1 we remain in state 

1. 

Otherwise, let o G i — 1] be the largest index such that te-i-i-o 7^ c— 1, and hence. 



tf}.^^_^ = c-1 for all ke{0,.. . ,o-l}. We have [t^^^J^_._i + c 



l—i — l—o 



and 



[t^'^\,^^-i =l-^-l^s well as [t^'^ = [t^'^ e-^-i-o-v Thus, 



m 



iteration z + 1 we are in state 2. 



Assume now that after iteration i the algorithm is in state 2 and let t*^^-* and t'-^-* 
be defined as above for state 2. Note that we have = t(^) for all j G [n] with 

= ^(1), Hence, we get either 

= t(2)_(c_l)c-^+^ + C-^+^+l=tW 

or = [t(')J,_,_i for all j G [n]. If o = 1, [t^'^,_,_, = [t'^'^,_,_,- Hence, in 

iteration z + 1 we are in state 1. If o > 1, we remain in state 2. 

Since in both possible states the number of different values of g^*-* is at most 2 for all 
z G {0, . . . , £}, the claim follows. □ 
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We can now prove Theorem 13.21 



Proof of Theorem \3.S[ Remember that for c even the theorem follows directly from 
Theorem 12.51 and Lemma 13.31 Hence we assume that c is odd for the rest of the 
proof. 

Let z E [0, 1] have c-ary length £ G N and p = zln- We iteratively round q^'^^ := p 
to colorings g^*^ such that after each iteration i E [£], the c-ary length of q^^\j) is at 
most £ — i, while the rounding error g^*)) remains small. At the end of this 

process, we obtain a 'pure' c-coloring q := q'^^\ 

The sequence )jG[£] is defined in the obvious way. Remember q^^^ = p E C^^\- 
Having defined g'-*-' E C^^\_^ for i < £, we apply Lemma 13.51 on g*^*) to get g(*+^) e 
C^c\-i-\ s^*^^ ^h-oX dA {q^^\ g(*+^)) < |c"^+*+^rj herdisc (A, c) with Tj being the number 
of different values of q^'^\ By Lemma [3.61 we have < 2 for all i E [£]. 

We compute 

£-1 

i=0 
£-1 

< ^ i(c - l)c-^+'+Viherdisc(/l, c) 

< c herdisc (A, c) 

Notice that we have restricted ourselves to z G [0, 1] having finite c-ary length. 
Since z i— > herwdisc(y4, is continuous and the set of numbers having finite c-ary 
expansion is dense in [0, 1], our bound carries over to arbitrary z. 

□ 
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4 A Tight Example 



We now construct an infinite family of liypergrapfis for wliicli tfie bound of Theorem 
13.21 is tight apart from constant factors. 

Let c G N>2, k G A^>2 and n := 2ck. We consider the complete hypergraph 
T-C = {[n], 2'"!) on n vertices. Note that for complete hypergraphs, all induced subhy- 
pergraphs have a discrepancy of at most the discrepancy of the complete hypergraph. 
Hence, the hereditary discrepancy equals the discrepancy of H. 

As is easily seen, a c-coloring that partitions the vertex set in c classes of equal sizes is 
optimal. The discrepancy is now witnessed, e.g., by monochromatic edges of size n/c, 
implying herdisc(H, c) = (1 — ^)n/c. On the other hand, we have herdisc(7i, 2) = 
n/4. Hence, we have herwdisc(7-^, 2) > herdisc(?i, 2) > her disc ( 7i, c). 



5 Summary and Outlook 



In [DoeOJ], it was shown that the hereditary discrepancies of a hypergraph in different 
numbers of colors differ by at most constant factors, which only depend on the 
number of colors involved. 

We improved this result by showing that we can bound the hereditary discrepancy 
in b colors by 0{a) times the hereditary discrepancy in a colors. Furthermore, such 
linear dependencies on the number of colors actually occur. 

This work raises a number of open questions. A natural one is a determination of 
the absolute constant involved. Theorem 13.21 shows the bound herwdisc(y4, 2) < 
cherdisc(y4, c). In Section HI a class of hypergraphs fulfilling herwdisc(7i, 2) > 
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|cherdisc(?i, c) was presented. We currently do not know if the absolute constant 
of 1 in Theorem 13.21 can be improved towards ^ . 

Possibly more interesting is the dependence on the second number of colors involved. 
Since we did a detour through 2 colors, we only obtained the bound herdisc(y4, b) = 
0{a) herdisc(74, a). However, all examples we know only witness a bound of herdisc(y4, b) 
0{a/b) herdisc(A, a). To show such a bound, it would be necessary to avoid the de- 
tour through 2 colors, since the bound herdisc(A,6) = 0(1) herdisc(A, 2) cannot be 
improved. Avoiding the detour would also be of independent interest, as would a 
direct proof avoiding the rounding approach. 

Another issue is the role of the linear discrepancy. The 2-color linear discrepancy of 
a matrix A is 



^aij{p{i) - q{])) . 

Hence the linear discrepancy problem is an extensi on of the weighted one where each 



lindiscfA, 2) := max min max 

p:[n]-»[0,l] g:[n]^{0,l} i&[m] 



vertex/column has an individual weight. In Doe04| . lindiscfA. 2) = O(c^) herdisc (A, c) 
was shown. It seems to be an interesting problem whether the linear discrepancy 
problem is harder than the weighted one in that we need this quadratic dependence 
or not. Again, we have no clue. 
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